69 results found.
Speech
Corpus,
Language Type:
Multilingual
Languages:
Croatian
Availability:
Freely Available
License:
LGPL
Size:
382000000 <Not Specified>Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:Rapid creation of large-scale corpora and frequency dictionaries
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Attila Zséder | <Not Specified> | None | ||
| Author 2 | Gábor Recski | <Not Specified> | None | ||
| Author 3 | Dániel Varga | BME MOKK | None | ||
| Author 4 | András Kornai | <Not Specified> | None | Hungarian Academy of Sciences | None |
| Main Contact | Gábor Recski | MTA SZTAKI | HU |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Croatian Serbian
Availability:
<Not Specified>
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
<Not Specified>
-
Paper title:An efficient language independent toolkit for complete morphological disambiguation
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | László Laki | Pázmány Péter Catholic University | HU |
| Author 2 | György Orosz | Pázmány Péter Catholic University, Faculty of Information Technology | HU |
| Main Contact | László Laki | Pázmány Péter Catholic University | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Croatian English
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Quality Estimation for Synthetic Parallel Data Generation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Raphael Rubino | Prompsit Language Engineering | DE |
| Author 2 | Antonio Toral | Dublin City Unversity | NL |
| Author 3 | Nikola Ljubešić | University of Zagreb | SI |
| Author 4 | Gema Ramírez-Sánchez | Prompsit Language Engineering | ES |
| Main Contact | Raphael Rubino | DFKI | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian Serbian
Availability:
Freely Available
License:
CC-BY-SA 3.0
Size:
235952967 words Production Status:
Newly created-finished
Use:
<Not Specified>
-
Paper title:TweetCaT: a tool for building Twitter corpora of smaller languages
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Nikola Ljubešić | University of Zagreb | SI |
| Author 2 | Darja Fišer | University of Ljubljana | SI |
| Author 3 | Tomaž Erjavec | Dept. of Knowledge Technologies, Jožef Stefan Institute | SI |
| Main Contact | Nikola Ljubešić | Jožef Stefan Institute | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Croatian
Availability:
Freely Available
License:
CreativeCommons
Size:
30000 relations OtherProduction Status:
Newly created-in progress
Use:
Textual Entailment and Paraphrasing
-
Paper title:VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ivan Sekulić | University of Zagreb, Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb | HR |
| Author 2 | Jan Šnajder | University of Zagreb, Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb | HR |
| Main Contact | Jan Šnajder | University of Zagreb, Faculty of Electrical Engineering and Computing, Unska 3, 10000 Zagreb | None |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian English
Availability:
Freely Available
License:
<Not Specified>
Size:
9387 entries Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Comparing two acquisition systems for automatically building an English–Croatian parallel corpus from multilingual websites
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Miquel Esplà-Gomis | Universitat d'Alacant | ES |
| Author 2 | Filip Klubička | University of Zagreb | HR |
| Author 3 | Nikola Ljubešić | University of Zagreb | SI |
| Author 4 | Sergio Ortiz-Rojas | Prompsit Language Engenering | ES |
| Author 5 | Vassilis Papavassiliou | Institute for Language and Speech Processing / RC Athens | GR |
| Author 6 | Prokopis Prokopidis | Institute for Language and Speech Processing/Athena RC | GR |
| Main Contact | Miquel Esplà-Gomis | Universitat d'Alacant | None |
Documentation:
Documentation in English is publicly available at http://redmine.abumatran.eu/projects/en-hr-tourism-corpus/documents
Written
Lexicon,
Language Type:
Multilingual
Languages:
Croatian
Availability:
Freely Available
License:
CC-BY-NC-SA
Size:
14192 Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:CroDeriV: a new resource for processing Croatian morphology
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Krešimir Šojat | <Not Specified> | None | Faculty of Humanities and Social Sciences, University of Zagreb | HR |
| Author 2 | Matea Srebačić | University of Zagreb | HR | ||
| Author 3 | Marko Tadić | University of Zagreb, Faculty of Humanities and Social Sciences | HR | ||
| Author 4 | Tin Pavelić | University of Zagreb | HR | ||
| Main Contact | Matea Srebačić | University of Zagreb | None |
Documentation:
Complete documentation in English will be available until spring 2014.
Written
Corpus,
Language Type:
Multilingual
Languages:
Croatian Slovenian
Availability:
Freely Available
License:
<Not Specified>
Size:
4222 Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Quality Estimation for Synthetic Parallel Data Generation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Raphael Rubino | Prompsit Language Engineering | DE |
| Author 2 | Antonio Toral | Dublin City Unversity | NL |
| Author 3 | Nikola Ljubešić | University of Zagreb | SI |
| Author 4 | Gema Ramírez-Sánchez | Prompsit Language Engineering | ES |
| Main Contact | Raphael Rubino | DFKI | None |
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
Croatian Serbian Slovenian
Availability:
Freely Available
License:
CC-BY-SA-NC
Size:
30 GByte Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
-
Paper title:Corpus-Based Diacritic Restoration for South Slavic Languages
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Nikola Ljubešić | University of Zagreb | HR | ||
| Author 2 | Tomaž Erjavec | Dept. of Knowledge Technologies, Jožef Stefan Institute | SI | ||
| Author 3 | Darja Fišer | University of Ljubljana | SI | ||
| Main Contact | Nikola Ljubešić | Jožef Stefan Institute | None | University of Zagreb | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Croatian Slovenian
Availability:
Freely Available
License:
<Not Specified>
Size:
573 Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Quality Estimation for Synthetic Parallel Data Generation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Raphael Rubino | Prompsit Language Engineering | DE |
| Author 2 | Antonio Toral | Dublin City Unversity | NL |
| Author 3 | Nikola Ljubešić | University of Zagreb | SI |
| Author 4 | Gema Ramírez-Sánchez | Prompsit Language Engineering | ES |
| Main Contact | Raphael Rubino | DFKI | None |
Documentation:
Available documentation in English




